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© Screening methods for protease Inhibitors. 



© The invention provides methods to screen for inhibitors of proteases. The first method involves the 
construction of a plasmld system such that expression of a protease is toxic to a bacterial strain. Compounds of 
interest may then be screened for those which inhibit the toxic protease and thereby prevent killing of the 
bacterial strain. Inhibition is detected by successful transformations of the bacterial strain by the plasmid coding 
for the protease. The second method involves the construction of a fusion protein which contains a protease 
cleavage site inserted into a protein whose activity may be assayed. Compounds are screened for protease 
inhibition, which is detected by assaying for a change in color, light or activity of the protein. HIV protease and 
polio 3C protease are examples of proteases that may be used in these methods. 
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SCREENING METHODS FOR PROTEASE INHIBITORS 



BACKGROUND OF THE INVENTION 



Proteases are a class of proteins some of whose members have been implicated in human diseases. 

5 For example, human renin is an aspartyl protease which has been implicated in hypertension through its 
role in the formation in plasma of the vasoactive peptide angiotensin (Bibliography entry 1). The proteases 
trypsin and elastase are involved in the onset of emphysema (2,3). Of particular interest is the protease 
associated with the human immunodeficiency virus ("HIV"). Polio protease is also of interest. 

The primary translation products of HIV include polyprotein precursors encoded by the gag and pol 

io genes (4). The gag gene encodes the capsid proteins; the pol gene encodes the viral enzymesTcomprising 
a 10 kd viral protease, a 66 kd reverse transcriptase and a 34 kd endonuclease (5). The production of these 
proteins is dependent upon the cleavage of these polyproteins by the viral protease, which is generated by 
autodigestion of the pol polyprotein. 

As a viral-specific enzyme, the protease is an attractive target for drugs for the inhibition of viral 

75 replication, and has become the focus of intensive investigation. The pol sequence (nucleotides 1833-2129) 
produces a mature 99 amino acid protease (6), with proline at the amino terminus and phenylalanine at the 
carbaxy-terminus (7; Graves). The 99 amino acid protease is homologous to the aspartlc protease family 
(8), and is inhibited by pepstatin A (9). Recombinant protease mutated at the catalytic Asp 25 residue is 
enzymaticaily inactive in bacterial expression systems and in vitro (5,6). and virus mutated at this site is 

20 non-infectious (6). The protease has recently been purified from E. coli and crystallized (10). 

At present, there Is an urgent need to develop therapeutic agenti against acquired immune deficiency 
syndrome ("AIDS"), which is believed to be caused by HIV. An agent which is selective for HIV protease 
might serve to inhibit HIV replication and thus be efficacious in the treatment of AIDS. There is thus a need 
for methods to screen for such inhibitors of HIV protease. 

25 Existing methods for assaying HIV protease utilize polyprotein or oligopeptide substrates, and they 
require gel electrophoresis or high performance liquid chromatography to ascertain the presence of 
cleavage products (11-16). Other methods recently developed for multiple samples include resonance 
energy transfer (17) and release of radioactive products (18). These methods are not easily adapted for 
assaying protease activity in large numbers of samples. 

30 Accordingly, it is an object of this invention to develop screening methods for identifying inhibitors of 
proteases. 

It is a particular object of this invention to develop screening methods for identifying inhibitors of HIV 
protease. 

It is a further object of this invention to develop screening methods for identifying inhibitors of polio 



The present invention provides methods for screening for compounds which inhibit protease enzymes. 
In one embodiment of the invention, the method is based first upon the construction of a plasmid having a 
DNA sequence which encodes a protease under the control of a T7 promoter. Next, a bacterial expression 
strain is selected which is not transformed by the plasmid because of the toxicity of the protease to the 
45 bacterial strain. A compound to be screened for inhibitory activity is then added together with the plasmid to 
the bacterial strain. If the bacterial strain is not killed and produces transformants, the compound is 
identified as an inhibitor of the protease. 

In particular, the first embodiment of the present invention provides plasmid constructs designated 
pPDB16 and pMae2 which contain a DNA sequence coding for HIV protease. These piasmids do not yield 
so transformants in an E. coli strain designated BL21(DE3). Thus, expression of HIV protease from the plasmid 
appears to be toxic to this E. coli strain. This discovery suggests a method for identifying agents which 
inhibit HIV protease. A similar plasmid construct, designated pPolio protease 3C, is described for polio 
protease, which may be used to screen for inhibitors of polio protease. 

In another aspect of the first embodiment of the invention, a method for screening for compounds which 
inhibit HIV protease is described. A plasmid is constructed having a DNA sequence which encodes HIV 



35 protease. 
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protease under the control of a T7 promoter. Next, a bacterial strain is selected which is transformed by the 
plasmid and is then killed by the induction of protease synthesis by an inducing agent. A compound to be 
screened for inhibitory activity is then added together with the inducing agent to the transformed bacterial 
strain. If the induced cells are not killed, the compound is identified as an inhibitor of HIV protease. 

5 In the second embodiment of the invention, the method involves the selection of a protein which is 
capable of being assayed for a change in color, light or activity when combined with a protease enzyme in 
the presence or absence of a potential inhibitor compound. 

Next, a heterologous DNA sequence coding for a peptide which includes a protease enzyme cleavage 
site is inserted into the gene coding for the selected protein. The fused gene is inserted into a suitable 

io expression system and is combined with the protease enzyme which recognizes and cleaves the peptide 
coded for by the heterologous DNA sequence embedded within the fusion protein, such that the protease 
enzyme cleaves at the protease enzyme cleavage site, which in turn inactivates the expressed fusion 
protein. 

This loss of activity is monitored by means of a color change, light change or by directly monitoring the 
15 loss of activity. A compound to be screened is included in the assay system and is identified as an inhibitor 
of the protease enzyme if the protein is not cleaved and remains active when monitored by any of the 
methods described previously. 

In particular, the second embodiment of this invention provides a plasmid construct designated 
p1 +IQSau, which includes the gene for l-galactosidase, into which is inserted a DNA sequence coding for 
20 a decapeptide HIV protease cleavage site. When expressed, this fusion protein retains £-gafactosidase 
activity. HIV protease cleaves and inactivates the fusion protein. 

Potentially inhibitory compounds of HIV protease are screened by monitoring 0-galactosidase activity, 
such that maintenance of 0-galactosidase activity indicates that the compound prevents cleavage and 
inhibits HIV protease. 

25 A similar construct, designated plasmid pZM3CSau, includes the gene coding for l-galactosidase into 
which is inserted a DNA sequence coding for a polio 3C protease cleavage site. This construct is used to 
screen for compounds which inhibit polio 3C protease using procedures similar to those described 
previously for HIV protease. 

30 

BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 depicts the HIV protease constructs pPDB16 and pMae2. Construct pPDB16 encodes the 18 kd 

as protease precursor under the control of the T7 promoter. The Bglll-Dral fragment from XBH5 (4), 
corresponding to nucleotides 1418-1899 of the HIV pol gene, is subcloned into the Ndel-BamHI sites of 
the previously described expression vector (19), using synthetic oligonucleotides. The upstream 
oligonucleotides begin with an Ndel restriction site, providing an in-frame translational start codon, and 
regenerate nucleotides 1406-1417. The downstream oligonucleotides regenerate nucleotides 1900-1906, 

40 followed by two in-frame stop codons and a BamHI restriction site. For clarity, base A of the initiation 
codon is defined as nucleotide (nt) 1. Amino acids +1 to +99 represent the mature 10 kd protease 
liberated by autodigestion at the indicated Phe-Pro bond (7). pMae2 encodes the mature 10 kd protease. 
Its construction is similar to PPDB16, except that the upstream oligonucleotides regenerate from 
nucleotide 1610 to the Maelll restriction site. 

45 Figure 2 depicts the expression of HIV protease in E. coli . Panel A: Protease synthesis in BL21 (DE3)- 
pLysS cells harboring pPDB16. Proteins are labelled~with~ 3s S methionine as described (19), followed by 
electrophoresis and autoradiography. Lane 1, no isopropyl-jS-D-thiogalactopyranoside ("IPTG"). Lane 2, 
1mM IPTG. The arrow denotes the mature 10 kd protease. Panel B: Protease synthesis in BL21(DE3) 
survivors harboring mutagenized pPDBl6. The numbers above each lane correspond to the clone 

so designations in Tables II and III. Molecular weight markers are shown on the left (68 kd - bovine serum 
albumin; 43 kd -ovalbumin; 29 kd - carbonic anhydrase; 18 kd -beta-lactoglobulin; 14 kd - lysozyme). 
Figure 3 depicts the killing of BL21 (DE3)pLysS upon induction of HIV protease synthesis with IPTG. 
Cells harboring plasmid encoding wild type (pPDB16) or mutant (pP48) protease precursor are grown to 
0.5 absorbance units (600 nm), and IPTG is added to the indicated concentrations. At the indicated 

55 times, an aliquot of each culture is removed, and dilutions are plated on LB plates containing kanamycin 
and chloramphenicol. After overnight incubation, colony forming units ( n CFU w ) per 100 ul of culture are 
determined. 

Figure 4 depicts the comparison of amounts of wild-type and mutant protease production. BL21(DE3)- 
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pLysS cells harboring the indicated plasmids are induced with 1mM IPTG, and proteins are labelled with 
35 S methionine as described (19). Whole cell extracts (lanes 1-5), and immunoprecipitates (lanes 6-10) 
from antiserum raised to amino acids 34-46 of the mature protease, are subjected to electrophoresis and 
autoradiography. Lanes 1 and 6, pPDB16. Lanes 2 and 7, pP48. Lanes 3 and 8, pMae2. Lanes 4 and 9, 
5 pP48Mae6. Lanes 5 and 10, pPol, encoding the polio polymerase (19). Lane M, molecular weight 
markers (68 kd - bovine serum albumin; 43 kd - ovalbumin; 29 kd -carbonic anhydrase; 18 kd - beta- 
lactoglobulin; 14 kd -lysozyme; 6 kd - bovine trypsin inhibitor). 

Figure 5, Panel A depicts the HIV protease plasmids p1 +1Q and p15-IQ which encode wild type and 
mutant HIV protease, respectively, under tac promoter control. E. coli strain MC1061 harboring these 

ro plasmids is used as a source of HIV protease for in vitro cleavage of 0-gaJactosidase. For monitoring 
cleavage in vivo , the 0-galactosidase/cleavage cassette genes are excised from the pZM plasmid series 
with Stul and BamHI, and ligated into the BamHI site of p1 +IQ or p15-IQ using Bam HI linkers, so that 
HIV protease and 0-galactosidase are co-expressed in the same E coli cell. Panel B depicts the 
insertion of the HIV protease cleavage site into 0-galactosidase. Oligonucleotides encoding the HIV 

75 protease cleavage site are inserted into the indicated restriction sites of the 0-gaIactosidase gene. The 
nucleotide (nt) and amino acid (aa) positions with 0-gaIactosidase are indicated. 
Figure 6, Panel A depicts the in vitro cleavage of gag polyprotein by HIV protease. Gag polyprotein is 
incubated with buffer (lane 1), or with wild type (lane 2) or mutant (lane 3) HIV protease, followed by 
electrophoresis and autoradiography. Molecular weight markers are shown on the left (67 kd -bovine 

20 serum albumin; 42 kd - ovalbumin; 28 kd -carbonic anhydrase; 18 kd • beta-lactoglobulin). Panel B 
depicts the in vitro cleavage of /3-galactosidase/cleavage cassette proteins by HIV protease. Extracts of 
MC1061 cells harboring HIV protease plasmid are incubated with extracts harboring 0-galactosidase. 
followed by Western blotting with anti-0-galactosidase. Lane 1, MC1061 (no plasmid). Lane 2, wild type 
HIV protease extract Lane 3, mutant HIV protease extract. MC1061/pZM1 extract (lane 4) is incubated 

25 with wild type (lane 5) or mutant (lane 6) HIV protease. MC1061/pZM-Sau extract from two independent 
clones (lanes 7 and 10) is incubated with wild type (lanes 8 and 11) or mutant (lanes 9 and 12) HIV 
protease. The upper arrow on the right indicates l-galactosidase; the lower arrow on the right indicates 
the cleavage product from pZM-Sau. Molecular weight markers are shown on the left (103 kd - 
phosphorylase b; 67 kd - bovine serum albumin; 42 kd - ovalbumin; 28 kd - carbonic anhydrase). 

30 Figure 7 depicts the inhibition of cleavage of 0-galactosidase by Pepstatin A. The 0- 
galactosidase/cleavage cassette protein from MC1061 cells harboring pZM-Sau is incubated with HIV 
protease in the presence or absence of Pepstatin A, followed by Western blotting with anti-0-galac- 
tosidase. Lane 1, no protease. Lane 2, with protease added. Pepstatin A dissolved in dimethylsulfoxide is 
added to the incubation at concentrations of 0, 0.2, 1 , 6, 25 and 50 ug/ml (lanes 3-8). The upper arrow 

35 on the right indicates 0-galactosidase; the lower arrow on the right indicates the cleavage product from 
pZM-Sau. The molecular weight markers shown on the left are identical to those in Figure 6, Panel B. 
Rgure 8 depicts the in vivo cleavage of £-galactosidase/cleavage cassette proteins by HIV protease. 
Panel A: MC1061 ceils "harboring p1+ IQSau (lane 1), p1+IQ-CIaA (lane 2), p1+IQ-ClaB (lane 3), 
p1 +IQ-Bss (lane 4), pi +IQ-Eco (lane 5), p1 +IQ (lane 6), or without plasmid (lane 7), are grown to mid- 
40 log phase, and IPTG is added to 1 mM. After 30 minutes, ceils are pelleted and resuspended in sample 
buffer, followed by electrophoresis and Western analysis with anti-0-galactosidase. Panel B: MC1061 
cells without plasmid (lane 1) or harboring p1 + IQSau (lane 2) or p15-IQSau (lane 3) are treated as in 
panel A. In each panel, the upper arrow on the right indicates 0-galactosidase; the lower arrow on the 
right indicates the cleavage product from p1 + IQSau. In each panel, the molecular weight markers are 

45 identical to those in Rgure 6, panel B. 

Rgure 9 is a photograph which depicts the results of an in vivo cleavage assay. Samples of E. coli strain 
MC1061 harboring various plasmids are spotted onto plates containing the color indicator X-gal. The 
plasmids are pZM-1 (a positive control for 0-gaIactosidase activity), pZM-2 (a negative control for £- 
galactosidase activity), pZM-Sau, p1+ IQSau (labelled 1+IQB-Sau) and p15-IQSau (labelled p15-IQB- 

50 Sau). In the cleavage assay with wild-type HIV protease, constructs which cleave ^-galactosidase 
produce a white color, while constructs which do not cleave /9-galactosidase produce a blue color. 
Plasmid pZM-1 is constructed as described below. Plasmid pZM-2 encodes mutant, inactive 0- 
galactosidase, and so produces white colonies. Plasmid pZM-2 is constructed by digesting plasmid 
pZM-1 with EcoRI and BamHI to delete the DNA sequence coding for the 18 amino acids at the carboxy- 

55 terminal end of /3-galactosidase. As expected, pZM-1 produces a blue color, as do pZMSau and p15- 
IQSau. Plasmid pZM-2, as a negative control, produces a white color; 0-galactosidase from p1 + IQSau is 
cleaved by HIV protease and produces a white color. 

Rgure 10 depicts the cleavage of polio protease cleavage cassette by polio protease in vitro . Extracts of 
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0-galactosidase containing either the polio protease cieavage site (in MC1061/pZM3CSau) or the HIV 
protease cleavage site On MCl061/pZM-Sau) are incubated with either polio or HIV protease, followed by 
Western analysis with anti-0-galactosidase. Lane 1, MC1061 without plasmid. Lane 2, polio 3C protease. 
Lane 3, HIV protease. Lane 4, MC1061/pZM3CSau unincubated. Lane 5, MC1061/pZM3CSau incubated 
with polio 3C protease. Lane 6, MC1061/pZM3CSau incubated with HIV protease. Lane 7. MC1061/pZM- 
Sau unincubated. Lane 8, MC1061/pZM-Sau incubated with polio 3C protease. Lane 9, MC1061/pZM- 
Sau incubated with HIV protease. Lane 10, MCl061/pZM1 unincubated. Lane 11, MC1061/pZMl 
incubated with polio 3C protease. Lane 12, MC1061/pZM1 incubated with HIV protease. The upper arrow 
on the right indicates 0-galactosidase; the lower arrow on the right indicates the cleavage product from 
the Sau constructs. Molecular weight markers are shown on the left (103 kd - phosphorylase b; 67 kd - 
bovine serum albumin; 42 kd - ovalbumin. 



The present invention is directed to two screening methods for inhibitors of proteases such HIV 
protease, renin, trypsin, eiastase and polio protease. Each screening method will first be described with 
respect to HIV protease and then for polio protease. The methods may also be adapted for use with other 
proteases in accordance with the principles of the invention. 

The first screening . method for inhibitors of HIV protease is directed to the construction of a plasmid 
which expresses HIV protease that is toxic to an E. coli strain. This toxicity, in turn, is useful to screen for 
agents which inhibit HIV protease. 

To study the HIV protease, the BgllhDral fragment from XBH5 (4), corresponding to nucleotides 1641- 
2122 of the HIV pol gene encoding the protease, is subcloned into a previously-prepared expression vector 
(pT7K) containing a phage T7 promoter (19), creating plasmid pPDB16 (Rg. 1). The construction of 
pPDB16 is carried out in E. coli strain DH5a (Bethesda Research Laboratories, Gaithersburg. Maryland), 
prior to transformation into the expression strain, E. coli BL21(DE3)pLysS. It is necessary to carry out this 
construction in a strain such as DH5«, which doei~noTcontain T7 RNA polymerase and, therefore, cannot 
express HIV protease, since expression of HIV protease will be shown to be toxic to the expression strain. 

An E. coli strain DH5a harboring plasmid pPDB16 has been deposited with the American Type Culture 
Collection and has been assigned ATCC accession number 68005. Plasmid pPDB16 may be obtained 
through conventional techniques (20) by growing this strain in culture, breaking open ceils, centrifuging and 
isolating the plasmid from the supernatant 

The expression strain E. coll BL21(DE3)pLysS contains a chromosomal copy of T7 RNA polymerase 
under the control of an inducible lacUVS promoter, and a chloramphenicol-resistant plasmid encoding T7 
lysozyme which inhibits transcription by T7 RNA polymerase in the absence of induction (21,22). Cells 
harboring pPDB16 are induced with the inducing agent IPTG in the presence of methionine, and protein 
synthesis is examined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis ("SDS-PAGE") using 
a 15 percent polyacryfamide gel (comprising 15 ml of 30% acrylamide, 1.3 ml of 2% bisacrylamide. 11.2 ml 
of 1M TrisCI (pH 8.7), 0.15 ml 20% SDS. 2.35 ml of H 2 0, 25 ul N.N.N'.N'-tetramethylethylenedlamine and 
100 ul 10% ammonium persulfate (20)), and autoradiography. The HIV DNA fragment in pPDB16 is 
expected to encode a primary translation product of 18 kd. Instead, as shown by SDS-PAGE, a prominent 
band of 10 kd is observed upon induction (Rg. 2A, lane 2). This observation is consistent with efficient 
autodigestion of the 18kd precursor into the mature 10 kd protease (the biologically active form) at the 
phenylalanine-proline bond (Rg. 1), and is in agreement with results reported previously (5,7). 

The 10 kd HIV protease band detected by autoradiography is not visible by Coomassie blue staining, 
suggesting that only low levels of this protein are made. It has been previously determined that the absolute 
level of expression of a cloned gene under T7 promoter control is lower in BL21 (DE3)pLysS than in the 
corresponding strain lacking lysozyme (BL21(DE3) (21)), due to the inactivation of T7 RNA polymerase by 
lysozyme (22). However, genes under T7 promoter control which are toxic to E. coii will not produce stable 
transformants in BL21(DE3), which lacks the lysozyme plasmid, pLysS (21). 

In an attempt to increase the level of expression of the HIV protease, the use of the expression strain 
BL21(DE3) is investigated by applicants. The results of two transformation experiments are shown in Table 
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TABLE I 





No. of transformants 


Plasmid 


BL21(DE3) 


BL21(DE3)pLysS 


A. 


pPDB16 


0 


400 


pPoll 


400 


400 


pPo!2 


350 


300 


B. 


pMae2 


0 


960 


p48Mae6 


330 


840 


pPoll 


600 


2400 



Table I. Transformation of BL21(DE3) strains by HIV protease constructs. BL21(DE3) cells containing or 
lacking the pLysS plasmid are made competent by the standard calcium chloride procedure (20). Frfty ng of 
protease or control plasmid are used for each transformation. Ptasmids pPDB16 and pMae2 are as 
previously described; plasmid pP48Mae6 encodes HIV protease with a point mutation at amino acid 29. 
Plasmids pPoll and pPol2 encode the polio polymerase (19). 

Whereas several plasmids as controls transform strains BL21(DE3) and BL21(DE3)pLysS equally well, 
plasmids pPDB16 and pMae2 encoding HIV protease do not yield transformants in BL21(DE3). These data 
strongly suggest that expression from pPDB16 or pMae2 is toxic to E. coH BL21(DE3). This toxicity, in turn, 
suggests a method of identifying agents which inhibit HIV protease. 

The apparent toxicity of HIV protease could be due to its inherent enzymatic activity, or to some other 
effect of this protein. This question is answered by isolating mutants of the HIV protease by screening for 
viable transformants of BL21(DE3). To introduce mutations into the HIV protease, plasmid pPDB16 is first 
transformed into E. coii strain LE30 mutD (Cold Spring Harbor Laboratory, Cold Spring Harbor, New York). 
This strain has a lO^IO 4 higher frequency of spontaneous mutation compared to wild-type E. coll (23). 
Plasmid is prepared from 50,000 pooled LE30 mutD transformants. and this collective plasmidTs~uIed for 
the transformation of BL21(DE3). Approximately 150 colonies are obtained. These clones are examined for 
HIV protease production by labelling of cells with 35 S methionine in the presence of IPTG, followed by SDS- 
PAGE using the system described previously and autoradiography. As shown in Fig. 2B, these putative 
mutants do not produce the 10 kd protein observed for the wild-type clone. Instead, they produce protein of 
either 18 kd (lanes M3, M4, M5). or protein smaller than 18 kd, but not 10 kd, (lanes M2, M6, M7). These 
proteins are only observed upon induction with IPTG, indicating that they are under the control of the T7 
promoter, in addition, when plasmid isolated from these putative mutants is added to a DNA-dependent 
transcription-translation system from E. coll , protein of the same size as those seen in labelled cells is 
observed only in the presence of T7 RNA polymerase. 

To determine if a specific genetic lesion could be correlated with the altered size of the inducible 
protein, the entire 500 bp HIV DNA insert of each putative mutant is sequenced. The first group of 18 BL21- 
(DE3) survivors is comprised of eight clones producing 18 kd protein (M3, M4, M5, M8. M9, M12, P2, and 
P5), seven clones producing protein smaller than 18 kd (M2. M6. M7, M10, M11, P49 and P74). and three 
clones which do not produce protein upon induction with IPTG (M1 , P3, and P26). 

The sequencing results are presented in Tables II and III. Table II presents nonsense mutants of HIV 
protease: 
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TABLE II 



Nonsense Mutants Of HIV Protease 


Clone 


Mutation 


P26 


CAG - UAQ at nt 60 


M7 


CAA - UAA at nt 225 


P49 


CGA - UGA at nt 228 


M10. M11 


G insertion at nt 259, UGA at nt 293 


M2 


G deletion at nt 352, UAA at nt 373 


P74 


G insertion at nt 353. UGA at nt 383 


M6, P52 


CAG - UAG at nt 387 



75 

Table II. The entire 500 bp protease insert of pPDB16 from BL21(DE3) survivors is subjected to DNA 
sequence analysis, and a premature stop codon is identified. Clone P26 encodes a truncated protease of 2 
kd which is not detected upon labelling of E. coti proteins, probably due to its small size. Each other clone 
encodes a protein smaller than 18 kd whichls detected by labelling (see Fig. 2B, lanes M2, MB, M7). 

20 Each clone which produces protein smaller than 18 kd contains a premature translation termination 
codon (Table II). The premature termination codons of clones M8, M7, and P49, result from a C -> U 
transition. For clones M2, M10. M11, and P74, the insertion or deletion of a G residue within a stretch of 5 
or 6 G's results in a translational frameshift which should lead to termination as indicated (Table II). In each 
case, whether a transition or frameshift occurs, the size of the protease predicted from the DNA sequence 

25 is in excellent agreement with the size of the protein observed upon induction (Fig. 2B). 

Clone P26, which apparently does not produce labelled protein upon induction with IPTG, is shown to 
contain a premature translation termination codon at nucleotide 61 (Table II). The truncated protein 
expected upon induction of this construct would be approximately 2 kd, and is too small to be resolved by 
the gel system used, which is based upon that of Plotch et al. (19). Clones M1 and P3 are also subjected to 

30 DNA sequence analysis, and are found to encode wild-type protease. Since whole plasmid is subjected to 
mutagenesis in strain LE30mutD, these clones may contain mutations outside of the HIV protease region 
that would prevent transcription or translation from genes under T7 promoter control, such as mutations in 
the promoter region itself or in the ribosome binding site (21). 

For each clone which produces 18 kd protein, a single nucleotide mutation leading to a single amino 

35 acid substitution is detected by DNA sequencing (Table III). After these initial results with the first group of 
18 clones are obtained, only those clones which produce 18 kd protein upon induction are examined in 
detail. The specific amino add change determined for each mutant is summarized in Table III: 



7 



EP 0 421 109 A2 

TABLE III 
Missense Mutant s Of HIV Protease 

Clone Position Mutation 





Nucleo- 
tide 


amino 
acid 








P57 


196 


-4 


UCC 


— > 


uuc 








Ser 




Phe 


P46 


211 


2 


CAG 


— > 


CGG 








Gin 




Arg 


P12, P37 


214 


3 


AUC 


— > 


AAC 








He 




Asn 


P32 


250 


15 


AUA 


— > 


ACA 








He 




Thr 


P29 


270 


22 


GCU 


— > 


ACU 








Ala 




Thr 


M12 


285 


27 


GGA 


■ — > 


AGA 








Gly 




Arg 


P48 


292 


29 


GAU 


— > 


GGU 



35 



SO 



55 



8 
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P15 297 31 

PI, P5 # P80 319 38 

P66 339 45 

M3, M5 346 47 

, 5 P31, P44 355 50 

M9 381 59 

P24 385 60 

P13, P47, P68 403 66 

M4, P73 424 73 

30 P45 f P79 427 74 

M8 433 7 6 



20 



25 



35 



40 



45 



SO 



P2, P8 436 77 

P22 442 79 

P38 460 85 

P9 463 86 

P50 465 87 

P27 484 93 



Asp 




Gly 


ACA 


— > 


GCA 


Thr 




Ala 


UUG 


— > 


UCG 


Leu 




Ser 


AAA 


— > 


GAA 


Lys 




Glu 


AUA 


— > 


ACA 


lie 




Thir 


AUU 


— > 


ACU 


lie 






UAU 


— — > 


CAU 


Tvr 
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55 Table III. Missense mutants of HIV protease. The entire 500 bp protease insert of pPDB16 from BL21(DE3) 
survivors producing 18 kd protease precursor (see text and Fig. 2B) is subjected to DNA sequence analysis, 
and a single point mutation is identified in each clone. Nucleotide and amino acid numbering are described 
in Fig. t. 
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In all, 24 different point mutations are identified. Six of these mutations, at amino acids lie 3, lie 47, lie 
50, Gly 73, Thr 74 and Val 77, are identified in two independent clones. The mutations at Leu 38 and lie 66 
are each identified in three independent clones. Because mutations are reisolated without obtaining some 
other known mutations (e.g., mutation at Asp 25) (4,5), it is possible that sister ptasmids from the 

5 mutagenized plasmid pool are being detected. 

Some of these mutations can be grouped according to the proposed function of the amino acid within 
HIV protease. The triplet Asp, Thr, and Gly, is conserved throughout the family of aspartyi proteases (8) and 
helps to locate the active site of HIV protease at residues 25, 26, and 27. Mutations at or near these 
residues should render the protease enzymatically inactive by disruption of the catalytic site. The three- 

70 dimensional structure of HIV protease has been recently solved (10), and it appears that applicants' 
mutations at Ala 22, Gly 27, Asp 29, Thr 31, lie 85, Gly 86, and Arg 87, are within the cleft of the active site. 
Applicants also find mutations in amino acids which are proposed to participate in hydrogen bonding within 
the protease, including Ala 22, Lys 45, He 47, He 66, Gly 73, Val 77, and He 85. Finally, the HIV protease is 
probably a dimer based on X-ray crystal structure (10,24), and applicants' mutations at Gly 27, He 50, and 

75 lie 93 should interfere with dimer formation. 

Loeb et al. (25) performed extensive site-directed mutagenesis between amino acids 21 and 46 of the 
HIV protease, and then determined processing activity. Most missense mutations abolished processing 
within the region, which is highly conserved among retroviral proteases. These mutations include Asp 29 •> 
Gly, and Thr 31 -> Ala, in agreement with applicants' results. However, in contrast to the approach of Loeb 

20 et al. f applicants' method of mutant isolation appears to select only for non-functional protease. Consistent 
with this are 31 mutations in amino acids 21-46 which Loeb et al. determined had little or no effect on 
processing. None of these mutations comes through applicants' screen. Applicants' results also indicate that 
point mutations throughout the protease, outside of the conserved regions, can destroy the activity of the 
enzyme (Table III). 

25 Kraussiich et al. (26) have shown that deletion of the carboxy-terminal 17 amino acids from the 18 kd 
HIV protease precursor prevents autodigestion which would liberate the mature 10 kd protease and 
furthermore, that the precursor is unable to cleave a gag substrate in trans . Similar results were obtained 
by Hansen et al. (27). Four of applicants' point mutations, at He 85, Gly 86, Arg 87, and He 93, are within 
these 17 carboxy-terminal amino acids. 

30 Applicants' results indicate that the mutant 18kd protease precursor is incapable of autodigestion, and is 
apparently not toxic to BL21(DE3). Without being bound by the ensuing discussion, applicants propose that 
the 10 kd mature protease exerts its toxic effect by digesting an unidentified essential E. coli protein, and 
that the 18 kd precursor is unable to carry out this digestion. Failure to generate the mature 10 kd protease 
could result from mutating an amino acid required for catalytic activity, or from disruption of the scissile 

35 Phe-Pro bond (located at amino acids -1 to +1 of the mature protease), such that autodigestion of the 18 
kd precursor could not occur. Applicants propose that the mutations at Ser -4, Gin 2, and He 3 (Table II), 
render the Phe-Pro bond uncleavable. Loeb et a). (25) used site-directed mutagenesis of the pol polyprotein 
(which encodes the protease, reverse transcriptase, and endonuclease) to introduce substitutions at the 
upstream Phe-Pro bond. One class of mutants is defective at autodigestion, but can still cleave at the non- 
40 mutated reverse transcriptase and endonuclease borders. This processing results either from a small 
amount of undetected mature protease, or from the precursor which retained activity. Another class of 
mutants which fails to liberate the mature protease is also inefficient at cleavage to generate the reverse 
transcriptase and endonuclease. It is possible that applicants' method of mutant detection, the requirement 
for stable transformation of BL21(DE3), selects for 18 kd precursors which are enzymatically inactive. 

45 To directly examine the effect of mature protease on the viability of BL21(DE3), applicants construct 
plasmids which encode only the 10 kd protease, bypassing the need for autodigestion of precursor (Fig. 1). 
As shown in Table IB, the wild-type HIV 10 kd protease plasmid (pMae2) does not transform strain BL21- 
(DE3) successfully, but is able to transform BL21 (DE3)pLysS. Plasmid p48Mae6, encoding protease with a 
point mutation at amino acid 29 (Table III), is able to transform both E. coli strains. 

so Further evidence that the HIV protease is toxic to BL21 (DE3)"Ts"provided by an examination of the 
survival of cultures in the presence of IPTG. In this experiment, cultures of BL21(DE3)pLysS cells harboring 
wild-type protease (pPDB16) or mutant protease (pP48) are grown to mid-log phase, and protease synthesis 
is induced with IPTG. The number of viable cells is determined by plating aliquots of cultures for colony- 
forming units. After 30 minutes of exposure to 1.0 mM or 0.1 mM IPTG, cells harboring wild type protease 

55 experience approximately four logs of killing compared to untreated cells (Fig. 3). In contrast, cells 
harboring the mutant protease are unaffected by the addition of IPTG. These data indicate that HIV protease 
is extremely toxic to BL21(DE3). Similarly, cells harboring plasmid encoding wild-type 10 kd protease 
(pMae2) are significantly more sensitive to IPTG than cells harboring plasmid encoding the corresponding 
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mutant protease (p48Mae6). 

An E. coli strain BL21(DE3) harboring plasmid pP48 has been deposited with the American Type 
Culture Collection and has been assigned ATCC accession number 68004. This plasmid-harboring strain 
may be subjected to one of the following methods to delete pP48 so as to leave E. coli strain BL21(DE3) 
5 (28.29): 

1) Curing with agents which preferentially inhibit plasmid replication, such as acridine orange, acriftavin, 
quinacrine, or ethidium bromide: For example, BL21(DE3VpP48 is grown in liquid culture overnight. The 
medium is 10g peptone and 10g yeast extract per liter, adjusted to pH 7.6 with NaOH. Then, 10 4 cells 
per ml are inoculated into a series of tubes containing 10-100 ug/ml (10 ug/ml increments) of acridine 

w orange. These cultures are incubated overnight at 37* in the dark. Cells are streaked onto LB plates 
from the culture containing the highest concentration of acridine orange which showed growth, individual 
colonies are tested for kanamycin resistance. Colonies which are kanamycin sensitive should have lost 
pP48, so that only E. coli strain BL21(DE3) remains. 

2) Spontaneous curingTFor example, BL21(DE3)/pP48 is grown to stationary phase several times in LB 
75 broth lacking kanamycin. Cells are streaked onto LB plates. Individual colonies are tested for kanamycin 

resistance. Colonies which are kanamycin sensitive should have lost pP48, so that only E. coli strain 
BL21(DE3) remains. 

3) Curing with agents which inhibit DNA gyrase, such as coumermycin or novobiocin: This method is 
identical to acridine curing, except curing agents are used at 1 to 7 micrograms per ml. 

20 4) Curing with an agent which inhibits RNA polymerase, such as rifampicin: This method is identical to 
acridine curing, except rifampicin is used at 5 to 10 micrograms per ml. 

The amount of protein produced upon IPTG induction by cells harboring plasmids encoding wild-type or 
mutant versions of the 18 kd or 10 kd protease is examined by 35 S methionine labelling and im- 
munoprecipitation, followed by gel electrophoresis and autoradiography (Fig. 4). It is estimated that mutants 

25 P48 and P48Mae6. which encode the 18kd precursor and the 10 kd protease respectively (lanes 2 and 4), 
produce as much as ten-fold more protease than the corresponding wild type clones PDB16 (lane 1) and 
Mae2 (lane 3). The decreased amount of the wild-type protease produced compared to the mutant is 
probably due to the killing of cells by wild-type protease upon induction with IPTG (Fig. 3). Unexpectedly, 
however, the mutants still produce very low levels of the protease or precursor, despite their lack of toxicity. 

30 The mutant proteins cannot be detected by Coomassie blue staining of the corresponding gels. Since 
different genes are expressed at different efficiencies in the T7 system (21), there may be a barrier to 
overproducing HIV protease, even when its toxicity is apparently alleviated. 

The HIV protease has been successfully expressed in a variety of E. coli K12 strains (5,6.7, 9,27), 
although at least in HB101, inhibition of growth upon induction of protease synthesis has been reported 

35 (30). The HIV protease may be especially toxic to BL21(DE3), which is an E. coli B strain. The toxicity of 
HIV protease to E. coli BL21(DE3)21 appears to indicate that the protease is functioning properly. In 
contrast to the resuits~7eported in this application, Krausslich et al. (15) report expression of HIV protease in 
strain BL21(DE3) lacking the iysozyme plasmid. Differences between the Krausslich et al. protease 
construct and that of the present invention may account for the consistent observation that BL21(DE3) 

40 cannot be transformed by pPDB16. The amino terminus of applicants' 18 kd protease is codon 1 of the pol 
gene preceded by a single methionine residue, whereas that of Krausslich et al. is codon 5 of the pol gene 
preceded by 12 amino acids from phage T7 gene 10 (26). In addition, the novel protease construct of this 
invention is on a plasmid encoding resistance to kanamycin. not ampicillin. However, the protease is also 
toxic to BL21(DE3) when introduced on a vector specifying ampicillin resistance, indicating that the putative 

45 target of the HIV protease is probably not the phosphotransferase which confers kanamycin resistance. 

Recent studies with peptide substrates have identified minimal sequence requirements for cleavage by 
the HiV protease, and as few as seven amino acids are sufficient (11,12,16). Thus, the bacterial protein 
which is the putative target of the protease need bear little homology to the natural gag and pol substrates. 
The toxicity of the HIV protease to E. coli strain BL21(DE3) provides a method to isolate mutants of this 

so important virai enzyme, and, more importantly, the potential to screen for protease inhibitors useful for 
therapy of HIV infection. 

Applicants have demonstrated that HIV protease plasmids pPDB16 and pMae2 are unable to success- 
fully transform E. coli strain BL21(DE3) (Table I). As a screen, these transformations are attempted in the 
presence of potential HIV protease inhibitors. Successful transformations serve to identify inhibitors of the 
55 protease. In another experiment, applicants also demonstrate that phage encoding HiV protease is unable to 
transfect E. coli strain BL21(DE3)F + . These transfections may also be attempted in the presence of 
potential pTotelse inhibitors. 

Another example of the first method for screening involves the use of BL21(DE3)pLysS transformed 
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with HIV protease plasmids pPDB16 or pMae<L The induction of protease synthesis by IPTG kills these 
cells, as demonstrated in Figure 4. Potential protease inhibitors might prevent the death of induced cells. 

The foregoing experiments with the first HIV protease screening method may be adapted to identify 
inhibitors of polio protease. The starting material for this embodiment of the invention is poliovirus type 1 
5 (Mahoney strain) which has been deposited previously with the American Type Culture Collection and has 
been assigned ATCC accession number ATCC VR-59. This poliovirus strain is used to generate the polio 
protease 3C through a series of steps described below. 

In summary, poliovirus strain 1 is first used to generate the plasmid pT7pv1-5 (31). The plasmid 
contains the entire cDNA sequence for poliovirus strain 1 under the control of the T7 promoter described 
io previously. Polio protease 3C is then subcloned from this plasmid. 

Specifically, in a first step, cDNA is prepared from poliovirus strain 1 and is subcloned into the T7 
vector to construct plasmid pT7pv1*5 by standard techniques. The polio protease gene (nucleotides 5438- 
5986) is excised from plasmid pT7pv1-5 as an Asul restriction fragment (nucleotides 5435-5953). Synthetic 
oligonucleotides are used to recreate the upstream and downstream coding sequences, and to provide 
75 translational start and stop codons, as well as the Ndei and BamHI restriction sites previously described 
with regard to Figure 1. Following this procedure yields a plasmid bearing polio protease 3C under the 
control of the T7 promoter. 

In experiments analogous to those with HIV protease, applicants' data suggest that polio protease 3C is 
also toxic to E. coli strain BL21 (DE3). As shown in Table IV below, the plasmid bearing polio protease 3C is 
20 unable to transform BL21(OE3) successfully, but does transform BL21 (DE3)pLysS. The control plasmid 
pPoh (encoding polio polymerase as described in Table I) transforms both strains with approximately equal 
efficiency: 

TABLE IV 

25 



No. of transformants 


Plasmid 


BL21(DE3) 


BL21(DE3)pLysS 


pPolio Protease 3C 
pPoh 


0 

300 


150 
500 



These data indicate that polio protease 3C, like HIV protease, is toxic to this E. coli strain, BL21(DE3), when 
35 placed under the control of the T7 promoter. Therefore, a screening method for polio protease inhibitors can 

readily be devised. In such a screen, a compound to be tested is shown to be an inhibitor of polio protease 

by preventing killing of the bacterial strain BL21(OE3) and permitting successful transformations of that 

strain by a plasmid coding for the polio protease 3C. 

In turn, the results with HIV protease and polio protease 3C provide the basis for a generic screen for 
40 protease inhibitors. Of particular interest are compounds which inhibit other proteases which have been 

implicated in human diseases, such as human renin, trypsin and elastase. 

The particular target protease is placed in a plasmid under the control of the T7 promoter. The system 

is next shown to be toxic to E. coH strain BL21(DE3), by failing to transform that strain even as strain BL21- 

(DE3)pLysS is transformed. Potential protease inhibitor compounds are then screened. Compounds which 
45 have inhibitory activity will be those which block the toxicity of the protease to BL21(DE3) and thereby 

permit transformations of that strain. 

The second screening method for inhibitors of protease enzymes is directed to the construction of 

fusion proteins comprising a reporter protein (meaning an easily assayed protein) into which a protease 

cleavage site has been introduced. The cleavage site corresponds to a portion of a natural substrate of the 
so protease of interest. 

The reporter protein chosen is capable of being assayed for a change in color, light or activity when 
combined with a protease enzyme. The introduction of the cleavage site to create a protease "cleavage 
cassette" does not destroy the activity of the reporter protein. Thus, the fusion protein can be used in an 
assay system to screen for inhibitors of proteases. 
55 When the protease enzyme is added to the fusion protein containing the protease cleavage site, the 
protease cleaves at the protease site, thereby inactivating the reporter protein. This loss of activity is 
monitored by means of a color change, light change or by directly monitoring the loss of activity. A 
compound to be screened is included in the assay system and is identified as an inhibitor of the protease if 
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the reporter protein is not cleaved and remains active when monitored by any of the methods described 
above. 

Examples of reporter proteins that are utilized in constructing the fusion proteins and which are assayed 
by monitoring a color change with a color indicator compound include £-galactosidase t alkaline 

5 phosphatase, ^-glucuronidase and acetyltransferase. This system may be adapted for proteins specifying 
antibiotic resistance, such as ^-lactamase. In a preferred embodiment of this invention, 0-galactosidase is 
used as the reporter protein. 

The 0-galacto$idase enzyme encoded by the lacZ gene of E. coli is responsible for the conversion of 
lactose into glucose and galactose, allowing utilization of lactose as a carbon source (32). Because £- 

io galactosidase will also catalyze the hydrolysis of several other substrates which are converted into colored 
compounds, ^-galactosidase has proven an invaluable toot to monitor a wide range of biological processes. 
For example, 0-galactosrdase fusions have been used to study transcriptional and translations regulation 
(33, and references therein). Fusion of heterologous protein to jS-galactosidase can facilitate the purification 
of that protein (34-37). In addition, interruption of the amino terminus of ^-galactosidase is the basis of 

is identifying desired clones when using the pUC plasmid series (38). 

The demonstration in this invention that foreign protein sequences can be inserted into specific regions 
of ^-galactosidase. resulting in both the retention of 0-galactosidase activity and the accessibility of the 
foreign sequences to proteases, is novel. In the specific case of the protease cleavage cassette, the fact 
that 0-galactosidase can now be cleaved, and inactivated, by the cognate protease provides a simple 

20 means to monitor protease activity, and to search for inhibitors of these important enzymes. 

Examples of suitable color indicator compounds include X-gaf and Fast Blue RR. A protein such as 0- 
gaiactosidase will react with X-gal to produce a blue colon when the protein is inactivated, no color change 
(white) is observed. A similar reaction of the protein with Fast Blue RR produces a red color; upon 
inactivation, no color change (white) is observed. 

25 Alternatively, the protein is chosen such that it catalyzes the tight-emitting chemical reaction of a protein 
derived from a bioluminescent or chemiluminescent organism. However, when the protein is cleaved by a 
protease enzyme, the reaction cannot occur, such that no light is emitted. The light may be observed with 
the naked eye, measured spectrophotometrically or recorded on X-ray film. In a preferred embodiment the 
reporter protein used is luciferase. 

30 Additionally, the inactivation by the protease may be monitored by an in vitro system which directly 
measures the activity of the protein in a reaction with a substrate. 

In all of the variants of the second embodiment of this invention, the fusion protein is expressed in a 
suitable expression system. The host is a bacterium, yeast, insect cell, mammalian cell line, or any other 
suitable host expression system. In a preferred embodiment, the host is a strain of E. coli . 

35 As illustrative of the second screening method of this invention, the constructionand use of an assay 
system for screening for inhibitors of HIV protease with jff-galactosidase and a color indicator will now be 
described. 

The initial step is to construct a plasmid containing the gene coding for 0-galactosidase. Plasmid pZMl 
is constructed as follows: E. coli strain MC1061 (araD139. A(ara,leu)7697, AlacX74 ( galU, galK, hsdR. strA 

40 is obtained from Pharmacia. Plasmid pZM1, containing the E. coli ^-galactosidase gene, is derived from 
pCH110 (Pharmacia). Plasmid pCH110 contains an EcoRI site which is deleted by replacing the smaller 
Pstl/BamHI fragment from pCH110 containing the ^-lactamase gene with the corresponding Pstl/BamHI 
fragment from a variant of pBR322 which lacks the EcoRI site normally present on that fragment. Next, the 
EcoRI/BamHt fragment containing the carboxy terminus of the 0-galactosidase gene is replaced with the 

45 oligonucleotide: 

5 1 MEKXZG^^ 3 f 

50 

which recreates the carboxy terminus of ^-galactosidase and places two stop codons at the end of the 
gene. This resulting plasmid is designated pZM1. 

The next step is to construct plasmids which encode wild type and mutant HIV protease, respectively, 
55 each under tac promoter control. These plasmids, designated p1+IQ and p15-IQ. are constructed as 
follows: The protease genes from pMae2 (wild type) or p48Mae6 (mutant), as described previously, are 
excised as Ndei/BamHI fragments, made blunt by Klenow enzyme, and ligated into the Sma site of 
pKK223-3 (Pharmacia), creating p1 + and p15-. The Sphl/AlwNI fragment from p1 + and pl5-, containing 
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the HIV protease gene, is then ligated to the Pvul/AlwNI fragment containing the laclQ gene from pGEX2T 
(Pharmacia), creating p1 +!Q and p15-IQ (Rg. 5A). To delete the BamHI restriction site in the polylinker of 
p1 + IQ and p15-!Q, these plasmids are partially digested with BamHI, made blunt with Klenow enzyme, and 
ligated. The resultant plasmids, designated p1+IQB and p15-IQB, contain a unique BaMHI site directly 
upstream of the tac promoter. 

Thereafter, an HIV protease cleavage site is inserted into 0-gaJactosidase by inserting an 
oligonucleotide into the l-galactosidase gene of plasmid pZM-1. Studies from several laboratories using 
synthetic peptides which span the different cleavage sites of the gag and gag/pot polyproteins, have 
determined the minimal amino acid sequence requirements for cleavagTby HIV protease (11,12,15,16). The 
p6/PR site is cleaved most efficiently in vitro , and as few as seven amino acids are sufficient for cleavage 
(15). Based on these minimal requirements, a series of oligonucleotides ("cleavage cassettes") are 
synthesized. These oligonucleotides correspond to the ten amino acids which span the p6/PR junction, and 
contain the appropriate cohesive ends for insertion, in frame, into six different locations of the p- 
gaiactosidase gene of plasmid pZM1 (Rg. 5B). 

The decapeptide Val-Ser-Phe-Asn-Phe-Pro-Gln-IIe-Thr-Leu corresponds to the p6/PR cleavage site of 
the HIV gag/pol polyprotein. Six pairs of oligonucleotides ("cleavage cassettes") encoding this decapeptide 
are synthesized. The oligonucleotides have in common the DNA sequence shown below containing a Hindlll 
restriction site (underlined), which encodes the decapeptide in each oligonucleotide pain 

Val Ser fhe Asn Itoe Pro Gin He Thr Leu 
5 V G EV AGC TT T AAC TIC OCT GAG ATC ACT GIG 3' 
3 V CAT TOG AAA TIG AAG GGA CTC 1BG TGA GAC 5 f 

Each pair contains additional nucleotides specifying different cohesive ends designed for insertion, in 
the proper reading frame, into the unique BssH2, Clal, Dralll, EcoRI, Saul or Sstl restriction sites of the E 
coli j8-galactosrdase gene of pZM1 (Rg. 5B). The six different sites in ^-gaiactosidase are selected because 
they are unique restriction sites within pZM1, and the cleavage cassette can be inserted by a simplified 
cloning procedure described below. The complete DNA sequences of the six pairs of oligonucleotides are 
as follows (with the HIV cleavage site set off by brackets): 
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ECOl/2 (xRI cohesive ends) : 

5' AA TTE f CTA ASC TTT AAC TIC OCT CAS AFC ACT CTG] C 3 V 
3* A [CAT TOG AAA TIG AAG GGA GTC TAG TSA GAC] GTC AA 5* 

SST1/2 (xSstl cohesive ends) : 

5 f G FG IA ASC TT T AAC TTC OCT GAG ATC ACT CTG] GAG CT 3* 

3 1 TC GAC [GAT TOG AAA TKS AAG GGA GTC TPG TGA GAG] G 5 1 

CLA1/2 (xClal cohesive ends) 

5* C T GTk ASC TTT AAC TTC OCT Qfl ATO ACT CT ] 3» 

3* [ AT TOS AAA ITS AAG GGA GTC TAG TGA GAG] C 5' 

DRA1/2 (xDralll cohesive ends) 

5 9 G fG IA ASC TT T AAC TTC OCT CAS ATT AOG CT ] 3 1 

3 1 C GAC [CAT TOS AAA TIG AAG GGA GTC TAA TG ] 5 1 

SAU1/2 (xMstllSaul cohesive ends) 

5 1 T GAA TG IA AGC TTT AAC TTC OCT CAG ATT AOS CTG] AC 3' 
3 1 T [CAT TOG AAA TIG AAG GGA GTC TAA TGC GAC] TGA CT 5' 

BSS1/2 (xBssH2 cohesive ends) 

5' OG OGA fG IA AGC TTT AAC TTC OCT CAG AIT AOG CTG] A 3* 
3* T [GAT TOG AAA TTG AAG GGA GTC TAA TGC GAC] TGC GC 5' 

To simplify the cloning procedures only the cohesive ends of each oligonucleotide pair resemble the 
target restriction site, so that successful ligation of the oligonucleotide into the 0-galactosidase gene 
destroys that restriction site. Therefore, after ligation of the oligonucleotide into pZM1, the ligation mixture is 
digested with the target restriction enzyme to linearize any residual parent piasmid pZM1. After ethanol 
precipitation, the digested ligation mix is used for transformation of E. colli strain MC1061. Correct clones 
are identified by the presence of a new Hindlll site. This method results in a high proportion of correct 
clones and eliminates the need for phosphatase treatment of the vector and kinasing of the 
oligonucleotides. 

The resultant plasmids are designated pZM-Bss, pZM-CIa A, pZM-Dra, pZM-Eco, pZM-Sau, and pZM- 
Sst respectively. Also constructed is piasmid pZM-CIa B, which contains the cleavage cassette in the 
reverse orientation encoding the unrelated decapeptide Glu-Ser-Asp-Leu-Arg-Glu-Val-Lys-Ala-Tyr, which 
should not be recognized by HIV protease. In each case, insertion of the cleavage cassette within the &• 
galactosidase gene is confirmed by DNA sequencing. 
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To ascertain whether the desired 0-galactosidase activity is retained despite the insertion of a cleavage 
cassette, E. coli strain MC 1061 harboring each construct is plated onto X-gaf plates. In this assay, 
functional j3-g5ictosidase results in the formation of blue colonies, but lack of jS-gaJactosfdase activity 
produces white colonies. Using procedures described below, each construct is also tested for quantitative 
5 iS-galactosidase activity and for whether it is cleaved by HIV protease in vitro and in vivo . Table V sets 
forth the results of this assay: 

TABLE V 



10 


Beta-Qatactosidase Activity of Cleavage Cassette Constructs, and 






Susceptibility to Cleavage by HIV Protease 






Plasmid 


Colony color on 


beta-galactosidase 


Cleavage by HIV 


15 




X-gal plates 


activity (units) 1 


protease 2 








in vitro 


in vivo 




pZM-Bss 


Blue 


5 


no 


no 




pZM-ClaA 


Blue 


10 


no 


no 




pZM-CIa B 


Blue 


163 


no 


no 


20 


pZM-Dra 


White 


0 


no 


n.d. 3 




pZM-Eco 


Blue 


2 


no 


no 




pZM-Sau 


Blue 


63 


yes 


yes 




pZM-Sst 


White 


0 


no 


n.d. 




pZM1 


Blue 


1179 


no 


n.d. 


25 


none 


White 


0 







1 beta-galactosidase activity is determined by hydrolysis of ONPG 

2 cleavage of beta-galactosidase by HIV protease is determined by Western analysis 
(see Figures 6 and 8) 

3 n.d. = not done 



Insertion of the cleavage cassette at the Saul, Clal, BssH2 or EcoRI sites results in blue colonies, 
indicating that the enzymatic activity of 0-galactosidase is retained, insertion of the cleavage cassette in the 

35 opposite orientation in the Clal site such that it encodes a decapeptide unrelated to the HIV protease 
cleavage site (Table V, pZM-CIa B) also results in theformation of blue colonies. These data indicate that 
the 0-galactosidase protein can tolerate insertions at several locations and still retain detectable enzymatic 
activity. Conversely, insertion at either Dralll or Sstl destroys £-galactosidase activity, since colonies 
harboring these constructs are white. 

40 The intensity of blue color in strains harboring the pZM plasmids Bss, Cla A, Cla B, Eco and Sau 
exhibits considerable variation, and in each case is less intense than that conferred by the parent plasmid 
pZM1. £-gaiactosidase activity is quantitated by hydrolysis of O-nitrophenyl-0-D-galactoside (ONPG) (39). 
Plates contain 5-bromo-4-chioro-3-indolyl-^-D-galactoside (X-gal) at 40 ug/ml. The assays use a modifica- 
tion of those described by Miller (40). Ceris are grown in Luria broth to an absorbance of 0.5 (600 

45 nanometers). Cells (0.5 ml) are pelleted and resuspended in 2 ml Z buffer (0.1 M Na3PO*. pH 7.0, 1 mM 
MgS04, 0.2 mM MnSCH, 0.1 M 0-mercaptoethanol, 0.2 mg/ml CTAB (hexadecyltrimethylammonium 
bromide), 0.1 mg/ml sodium deoxychblate), and incubated for five minutes at 28* C. Next, 0.6 ml of 4 mg/ml 
ONPG in 0.1 M NaaPO*. pH 7.0 are added, and incubated at 28 # C until the solution turns yellow (10 
minutes to overnight). To stop the reaction, 1.3 ml of 1 M Na 2 C03 are added. The absorbance is then read 

so at 420 and 550 nanometers. The units of jS-galactosidase are calcuated using the following equation: 

Units » (A 42Q" L75 > X A 55Q 
55 t X v X A 60Q 

where A = absorbance, t = time of reaction in minutes, and v = volume of culture used in ml. 
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The results of the quantitative assay are also shown in Table V. The blue clones exhibit significantly less 0- 
galactosidase activity than the parent plasmid pZM1, ranging, from about 10 fold less (pZM-CIa B) to about 
500 fold less (pZM-Eco). Without being bound by the following, it is proposed that the reduced 0- 
galactosidase activity observed upon insertion of the cleavage cassette may result from a change in the 
5 structure of this protein which attenuates its enzymatic function, or from 0-galactosidase/cleavage cassette 
protein which displays wild-type enzymatic activity, but is present in reduced amounts, or from a 
combination of these two factors. As expected, white clones pZM-Dra and pZM-Sst do not produce 
detectable levels of ^-galactosidase activity in this assay. 

The l-galactosidase/cleavage cassette proteins are next tested for susceptibility to cleavage by HIV 

io protease in vitro . Two samples of E coli strain MC1061, one harboring protease plasmid p1 +IQ and the 
other plasmid p15-IQ, are grown to 0.5 absorbance units (595 nanometers). Protease synthesis is induced 
by addition of IPTQ to 1 mM. After one hour, cells are pelleted, resuspended in 0.66 volumes M buffer (50 
mM 2-(N-morpholino)ethanesulfonic acid (MES), pH 6, 1 mM dithiothreitol, and 25 mM NaCI) containing 1 
mM EDTA and 1 mg/ml lysozyme, and incubated on ice for 10 minutes. Cells are pelleted and resuspended 

75 in 0.01 volume M buffer, and lysed by three freeze/thaw cycles (freezing rapidly in dry ice/ethanol, and 
thawing slowly in ice water). After the final thawing, the extract is centrifuged and the clear supernatant is 
used as a source of protease. 

The same method is used to prepare extracts containing 0-galactosidase from E. coli MC1061 cells 
harboring the pZM plasmid series described previously, except that IPTG is omitted ahdTTl volume of M 

20 buffer is used for the freeze/thaw cycle. For cleavage reactions, 20 ul of jS-galactosidase extract is typically 
incubated with 5 ul of protease extract at 37* for one hour, followed by electrophoresis of incubation 
reactions on SDS-polyacrylamide gels, l-galactosidase is visualized by Western blot analysis using house 
anti-£-galactosidase polyclonal serum (Sigma) and the ImmunoSelect kit (Bethesda Research Laboratories). 
3S S-methionine-labelled control substrate corresponding to the HIV gag polyprotein (nt 109-1417) (4) is 

25 produced from the T7 promoter in E coli strain BL21(DE3)pLysS. 

An extract of E. coli strain MC1Q61 containing wild type protease (from p1+IQ), but not mutant 
protease (from p15-IG), is able to cleave a gag polyprotein which is a natural substrate for HIV protease 
(Rg. 6A). Extracts of E coli harboring jS-galactosidase/cleavage cassette constructs are incubated with 
either wild-type or mutant HIV protease extract, and cleavage of 0-gaiactosidase is monitored by Western 

30 analysis using anti-0-galactosidase antibodies (Rg. 6B). Native /3-galactosidase encoded by pZM1 is 
unaffected by these incubations (lanes 4-6). Of the six 0-galactosidase/cleavage cassette proteins exam- 
ined, five of these (Bss, Cla A, Ora, Eco and Sst) are likewise undigested by HIV protease (Table V), as is 
the Cla B clone containing a reversed decapeptide insertion used as a control. 

In contrast, the 0-gaiactosidasa/cleavage cassette encoded by two independent pZM-Sau clones, is 

35 cleaved by wild type HIV protease (lanes 8 and 11), producing a cleavage product of approximately 100 kd, 
consistent with cleavage at the Saul insertion site located 80 amino acids from the amino terminus. The 
other expected cleavage product of approximately 8 kd is not detected and may be unstable, incubation of 
the pZM-Sau jS-galactosidase/cleavage cassette with mutant HIV protease (Rg. 6B, lanes 9 and 12) fails to 
produce cleavage, indicating that the cleavage event is mediated by HIV protease and not by an E. coli 

40 protease. ~~ 

To further examine the basis for reduced 0-galactosidase activity in the /3-galactosidase/cleavage 
cassette clones, the amount of ^-galactosidase is quantitated by Western analysis, using anti-l-galac- 
tosidase antibody. It is found that the pZM-Sau construct has significantly less ^-galactosidase protein 
compared to the parent pZM1 (Rg. 6B, compare the intensity of the jS-gaiactosidase band in lane 4 with 

45 lanes 7 and 10). This difference is quantitated more accurately by Western analysis of different dilutions of 
cells harboring pZM1 and pZM-Sau. and it is determined that the pZM-Sau clone has about 10-20 fold less 
j9-galactosidase protein than the parent clone pZMl (data not shown). Thus, for pZM-Sau, a reduced 
amount of ^-galactosidase protein with approximately wild-type levels of enzymatic activity, could account 
for the reduction of ^-galactosidase activity observed on X-gal plates and in the ONPG assay (Table V). 

so Western analysis on the other cleavage cassette clones demonstrates similarly reduced levels of £- 
galactosidase protein sufficient to explain their decrease in enzymatic activity. The non-functional jS- 
gaiactosidase proteins encoded by pZM-Dra and pZM-Sst are likewise present in reduced amounts. Since 
the native ^-galactosidase encoded by pZMl and the altered 0-galactosidase/cleavage cassette proteins 
are expressed from identical vectors which differ only by the insertion of the cleavage cassette in the j9- ' 

55 galactosidase gene, it appears that the 0-galactosidase/cleavage cassette proteins are less stable than the 
native protein. 

In another experiment pepstatin A, a known inhibitor of HIV protease (34). is tested as to whether 
cleavage of the ^3-galactosidase/cleavage cassette protein pZM-Sau incubated with HIV protease is inhibited 



17 



EP0 421 109 A2 



by this compound. Pepstatin A at 1 ug/ml completely inhibits the cleavage of the altered 0-gafactosidase, 
further indicating that the cleavage is due to HIV protease (Figure 7). 

The results in vitro demonstrate that with the exception of the pZM-Sau construct, 0- 
galactosidase/cleavage cassette proteins are not cleaved by HIV protease. The cleavage cassette in the 
5 protease-resistant proteins might be buried in the interior of the proteins and so be inaccessible to the 
protease in an in vitro assay. For this reason, it is useful next to introduce HIV protease into the same cell 
as these /3-galactosidase/cleavage cassette proteins, to allow for co-translational cleavage in vivo . 

To test for cleavage of the 0-galactosidase/cleavage cassette encoded by pZM-Sau invivoTthis altered 
/S-galactosidase gene is subcloned Into the plasmids p1+IQ and p15-IQ, which encode wild type and 
ro mutant HIV protease, respectively. Specifically, the altered /3-galactostdase gene from pZM-Sau is excised 
as a Stul-BamHI fragment and ligated into the Bam HI site of pi -HQ and p15-IQ, creating plasmids 
p1 +IQSau and p15-IQSau, respectively. Samples of an E. coli strain MC1061 harboring plasmld pt +tQSau 
have been deposited with the American Type Culture Collection and have been assigned ATCC accession 
number 68,352. 

rs For comparison purposes, the £-gaIactosidase/cleavage cassette regions of pZM-Bss, pZM-CIa A, and 
pZM-Eco are also subcloned into p1 +IQ. The altered 0-gaIactosidase produced by pZM-CIa B is used as a 
control. Since pZM-Dra and pZM-Sst produce non-functional 0-galactosidase, these will not be considered 
further. 

Total protein from cells harboring these resultant plasmids p1 +IQSau and p15-IQSau is subjected to 
20 SDS-PAGE, and cleavage of 0-galactosidase is monitored by Western analysis (Fig. 8). This 0- 
galactosidase/cleavage cassette is cleaved in vivo by wild type HIV protease (panel A, lane 1, and panel B, 
lane 2), but not by mutant protease (panel B, lane 3), generating a cleavage product of the same size as 
that observed in vitro (Fig. 6B, lanes 8 and 11). It is estimated that approximately 50-75% of the 0- 
galactosidase/cleavage cassette protein is cleaved. These cleavage assays are done in liquid culture. The 
25 amount of cleavage which occurs in colonies is not directly quantitated, but E. coli MC1061 harboring 
p1 +IQSau are white on X-gal plates, whereas p15-IQSau confers blue color. These date are consistent with 
cleavage of the jS-galactosidase/cleavage cassette by wild type, but not by mutant, HIV protease. 

HIV protease does not cleave any of these other ^-galactosidase/cleavage cassette proteins in vivo that 
are protease-resistant in vitro (Rg. 8A t lanes 2-5, and Table V). As expected, cells harboring pi +lQBss, 
30 pi +IQCIa A, p1 +IQCIa B, or p1 +IQEco are blue on X-gal plates, consistent with lack of cleavage by HIV 
protease. 

The results of another HIV protease in vivo cleavage assay are shown in the photograph of Figure 9. 
Samples of E. coli strain MC1061 harboring various plasmids are spotted onto plates containing the color 
indicator X-gal. The plasmids are pZM-1 (a positive control for 0-galactosidase activity). pZM-2 (a negative 

35 control for 0-galactosidase activity), pZM-Sau. p1+ IQSau (labelled 1+IQB-Sau) and p15-IQSau (labelled 
p15-IQB-Sau). In the cleavage assay with wild-type HIV protease, constructs which cleave 0-galactosidase 
produce a white color, while constructs which do not cleave 0-galactosidase produce a blue color. Plasmid 
pZM-2 encodes mutant, inactive 0-galactosidase, and so produces white colonies. Plasmid pZM-2 is 
constructed by digesting plasmid pZM-1 with EcoRI and BamHI to delete the DNA sequence coding for the 

40 18 amino acids at the carboxy-terminal end of 0-gaIactosidase. As expected. pZM-1 produces a blue color, 
as do pZMSau and p15-IQSau. Plasmid pZM-2, as a negative control, produces a white colon 0- 
galactosidase produced by pi + IQSau is cleaved by the protease and produces a white color. 

The 0-galactosidase protein which has been modified by insertion of a viral protease cleavage cassette, 
can be used in a screen to identify inhibitors of that protease, or of any other protease whose cleavage 

45 sequence is known and can be introduced into the Saul site of l-galactosidase. 

The plasmid p1 + IQSau in an appropriate E. coli strain, such as BAS849 (41) (which has a mutation in 
the outer membrane, rendering it more permeable to various compounds), is plated as a lawn onto an agar 
plate. Alternatively, a liquid culture may be used as a nutrient medium. The plate should contain sufficient 
IPTG to induce HIV protease synthesis, cleaving 0-galactosidase and rendering the lawn colorless when 

so assayed with j3-galactosidase color indicators, such as X-gal (40), Fast Blue RR (42) or others. Possible 
inhibitors of the protease are spotted onto the lawn, which is then overlayed with the indicator. 

A protease inhibitor will inactivate the protease, preventing cleavage of 0-galactosidase. The 0- 
galactosidase then reacts with the indicator, causing the spot to become colored (blue in the case of X-Gal, 
red in the case of Fast Blue RR). Thus, protease inhibitors are identified. 

55 A similar screening method may be used where a light-emitting reaction is monitored, rather than a 
color change. The cleavage cassette is inserted into a protein such as luciferase (Clontech Laboratories, 
Inc., Palo Alto, CA), rather than 0-galactosidase. In the absence of a protease enzyme, the 
luciferase/cleavage cassette protein catalyzes the light-emitting chemical reaction. In the presence of 
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luciferin and ATP, an enzyme-bound luciferyl-adenylate complex is formed, which is followed by oxidative 
decarboxylation. The reaction products are CO2. oxy luciferin and AMP, together with the emission of light 
The light emission may be observed with the naked eye. measured spectrophotometrically or recorded on 
X-ray film. 

5 When a protease enzyme is present, it cleaves at the protease enzyme cleavage site, thereby blocking 
the reaction catalyzed by luciferase, such that no light is emitted. In a screen for protease inhibitors, a test 
compound is included in the assay system. If the compound prevents cleavage by the protease, the 
reaction proceeds and light is emitted, thereby identifying the compound as a protease inhibitor. 

As described above, in both the color indicator and light-emitting reaction in vivo assays, the expression 
10 system contains genes coding for the /9-galactosidase/cleavage cassette protein and HIV protease. The 
expression system expresses both proteins used in the assay system. 

The 0-galactosidase/cleavage cassette protein can also be assayed directly in a test tube. In this in 
vitro assay, the expression system contains only the gene coding for the 0-galactosidase/cleavage cassette 
protein. The HIV protease is generated separately and kept apart from the fusion protein until the time of 
75 the assay. At that time, the purified 0-galactosidase/cleavage cassette protein is mixed with HIV protease 
and a potential protease inhibitor in solution. The 0-galactosidase activity is monitored by hydrolysis of 
ONPG, using a spectrophotometer. A protease inhibitor will increase l-galactosidase activity over that 
observed in a control experiment with only the fusion protein and HIV protease. 

The foregoing screening methods of the second embodiment of this invention may also be used to 
20 screen for inhibitors of other proteases, such as polio 3C protease. First, a 0-galactosidase/cleavage 
cassette protein is constructed. For example, an oligonucleotide encoding the polio 3C protease cleavage 
site Met-Qlu-Ala-Leu-Phe-Gln-Gly-Pro-Leu-Gln-Tyr-Lys-Asp (43) is inserted, in frame, into the Saul site of 0- 
galactosidase, creating pZM3CSau. Cells harboring this plasmid are blue, consistent with results for pZM- 
Sau, which contains the HIV protease cleavage cassette (Table V). These data further indicate that the Saul 
25 site is an appropriate site for insertion of heterologous protein sequences without destroying jS-galac- 
tosidase activity. 

As shown in Figure 10, this fusion protein is cleaved by polio 3C protease (lane 5), which is expressed 
in BL21(DE3)pLysS in similar fashion as Nicklin et al. (44), but not by HIV protease (lane 6), as expected. 
This construct may then be used to screen for inhibitors of polio 3C protease, using the methods described 
30 previously. 
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Claims 

30 1. A method for screening for compounds which inhibit protease enzymes, which comprises the steps of: 

(a) constructing a plasmid having a ONA sequence which encodes a protease under the control of a 17 
promoter; 

(b) selecting a bacterial expression strain which is not transformed by said plasmid because of the 
toxicity of the protease to the bacterial strain; and 

35 (c) adding a compound to be screened together with the plasmid to the bacterial strain such that, if the 
bacterial strain is not killed and produces transformants. the compound is identified as an inhibitor of the 
protease. 

2. The method of claim 1, wherein the bacterial strain is E. coli BL21(DE3) and the plasmid has a DNA 
sequence which encodes human immunodeficiency virus protease. 
40 3. The method of claim 1, wherein the plasmid is pPolio protease 3C and the plasmid has a DNA sequence 
which encodes polio protease 3C. 

4. A method for screening for compounds which inhibit human immunodeficiency virus protease, which 
comprises the steps of: 

(a) constructing a plasmid having a DNA sequence which encodes human immunodeficiency virus 
45 protease under the control of a T7 promoter; 

(b) selecting a bacterial strain transformed by said plasmid, which strain is killed by the induction of 
protease synthesis by the inducing agent isopropyl-j8-D-thiogalactopyranside; and 

(c) adding a compound to be screened together with isopropyl-0-D-thtogalactopyranoside to the trans- 
formed bacterial strain such that, if the induced cells are not killed, the compound is identified as an 

so inhibitor of the human immunodeficiency virus protease. 

5. The method of claim 4, wherein the bacterial strain is E. coli BL21(DE3)pLysS. 

6. A method of preventing the expression of enzymatically active 10 kd human immunodeficiency virus 
protease, which comprises: (a) introducing a mutation into the DNA sequence which encodes the 18 kd 
protease precursor of the 10 kd protease, thereby preventing autodigestion of the precursor which would 

55 otherwise liberate the mature, enzymatically active 10 kd protease; or (b) introducing a mutation into the 
DNA sequence which encodes the 10 kd protease. 

7. A method of producing non-functional 10 kd human immunodeficiency virus protease, which comprises 
modifying the DNA sequence which encodes the 18 kd protease precursor of the 10 kd protease, such that 
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the scissile Phe-Pro bond located at amino acids -1 to 1 of the mature 10 kd protease is disrupted. 

8. A method for screening for compounds which inhibit protease enzymes, which comprises the steps of: 
(I) 

(a) selecting a reporter protein which is capable of being assayed by reacting with a color indicator 
compound to produce a color change; 

(b) inserting into the gene coding for said protein a heterologous DNA sequence coding for a peptide 
which includes a protease enzyme cleavage site; 

(c) inserting into a suitable expression system the construct of step (b) and a gene coding for a 
protease enzyme which recognizes and cleaves the protease enzyme cleavage site of the peptide 
coded for by the heterologous DNA sequence of step (b); 

(d) preparing an assay system which comprises: 

(i) the expression system of step (c); 

(ii) a color indicator compound; and 

(iii) a nutrient medium; and 

(e) adding a compound to be screened to the assay system, such that if the compound inhibits the 
protease enzyme, the expression system produces an intact construct of step (b), the reporter protein 
retains its activity and color is maintained in the assay system, but if the compound is not an inhibitor, 
the protease cleaves the protease enzyme cleavage site of the peptide of step (b), the reporter protein 
is inactivated and no color is observed in the assay system; or 

(») 

(a) selecting a reporter protein which is capable of catalyzing a light-emitting chemical reaction; 

(b) inserting into the gene coding for said reporter protein a heterologous DNA sequence coding for a 
peptide which includes a protease enzyme cleavage site; 

(c) inserting into a suitable expression system the construct of step (b) and a gene coding for a 
protease enzyme which recognizes and cleaves the protease enzyme cleavage site of the peptide 
coded for by the heterologous DNA sequence of step (b); 

(d) preparing an assay system which comprises: 

(i) the expression system of step (c); and 

(ii) a compound which undergoes a light-emitting chemical reaction catalyzed by the reporter 
protein of step (a); and 

(e) adding a compound to be screened to the assay system, such that, if the compound inhibits the 
protease enzyme, the expression system produces an intact construct of step (b), the reporter protein, 
retains its catalytic activity and a light-emitting chemical reaction occurs, but if the compound is not 
an inhibitor, the protease cleaves the protease enzyme cleavage site of the peptide of step (b), the 
reporter protein is inactivated and the light-emitting chemical reaction does not occur. 

9. A method of claim 8, wherein in (I) the reporter protein of step (a) is 0-galactosidase, in (II) the reporter 
protein of step (a) is luciferase; and in either (I) or (II), the protease cleavage site of step (b) is the human 
immunodeficiency virus p6/PR gag /pol polyprotein cleavage site, the expression system of step (c) 
includes an E . coli host strain, and the protease enzyme of step (d)(i) is the human immunodeficiency virus 
protease. ~ 

10. A method of claim 8, wherein in either (I) or (II) the protease cleavage site of step (b) is a polio 3C 
protease cleavage site and the protease enzyme of step (d)(i) is polio 3C protease. 

1 1 . A method for screening for compounds which inhibit protease enzymes, which comprises the steps of: 

(a) selecting a reporter protein whose activity is capable of being monitored by reacting with a substrate; 

(b) inserting into the gene coding for said reporter protein a heterologous DNA sequence coding for a 
peptide which includes a protease enzyme cleavage site; 

(c) inserting into a suitable expression system the construct of step (b); 

(d) preparing an assay system which comprises: 

(i) a protease enzyme which recognizes and cleaves the protease enzyme cleavage site of the 
peptide coded for by the heterologous DNA sequence of step (b); 

(ii) the expression system of step (c) harboring the construct of step (b); and 

(iii) a substrate which reacts with the reporter protein of step (a); and 

(e) adding a compound to be screened to the assay system, such that, if the compound inhibits the 
protease enzyme, the expression system produces an intact construct of step (b), and the reporter 
protein retains its activity as monitored by reaction with the substrate, but if the compound is not an 
inhibitor, the protease cleaves the protease enzyme cleavage site of the peptide of step (b), and the 
reporter protein is inactivated. 

12. A method of claim 11, wherein the protein of step (a) is 0-galactosidase. the protease cleavage site of 
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step (b) is the human immunodeficiency virus p6/PR gag /pol polyprotein cleavage site, the expression 
system of step (c) includes an E. colt host strain, thI~protease enzyme of step (d)(i) is the human 
immunodeficiency virus protease, and the substrate of step (d)(iii) is O-nitrophenyt-l-D-galactoside. 

13. A method of clam 11, wherein the protease cleavage site of step (b) is a polio 3C protease cleavage 
5 site and the protease enzyme of step (d)(i) is polio 3C protease. 

14. A plasmid selected from the group consisting of plasmids designated pPDB16, pMae2, p1 +IQSau and 
pZM3C$au. 

15. A bacterial strain harboring a plasmid selected from the group consisting of E. coli strain DH5a 
harboring plasmid pPDBl6 (ATCC 68005), E coli strain BL21(DE3) harboring plasmid pP48"(ATCC 68004) 

io and E. coli strain MC1061 harboring plasmid p1 +IQSau (ATCC 68,352). 
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SEQUENCE LISTING 
INFORMATION FOR SEQ ID NO: It 
SEQUENCE TYPE: Oligonucleotide 
SEQUENCE LENGTH: €0 nucleotides 

FRAGMENT TYFBs Carboxy terminus of 0-galactosidase plus 
two atop codons 

AAT TCC AGC TGA GCG CCG GTC GCT ACC ATT ACC AGT TGG TCT 42 
GG* GTC AAA AAT AAT AAG 60 

INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE TYPE: Oligonucleotide 
SEQUENCE LENGTH: 30 nucleotides 

FRAGMENT TYPE: Internal coding region of the HIV gag/po^ 
polyprotein 

FEATURES: Codes for a decapeptide corresponding to the 
p6/PR cleavage site of the HIV gag/sol polyprotein 

GTA AGC TTT AAC TTC CCT CAG ATC ACT CTG 30 

Val Ser Phe Asn Phs Pro Gin lie Thr Leu 

IS 10 
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INFORMATION FOR SEQ ID NOt 3 1 
SEQUENCE TYPE: Oligonucleotide 
SEQUENCE LENGTH: 36 nucleotides 

FRAGMENT TYPEs Modified internal coding region of the KZV 
SOSZBfil polyprotein 

FEATURES i Codes for a decapeptide corresponding to the 
p6/PR oleavage site of the HIV crag/pol polyprotein, and 
oontains a cohesive end for the EcoRI restriction site of 
the £« fiflil j-galactosidase gene of plasmid pZMl. 



AATTT GTA AGC TTT AAC TTC OCT CAG ATC ACT CTG C 36 



INFORMATION FOR SEQ 10 NO; 4) 
SEQUENCE TYPES Oligonucleotide 
SEQUENCE LENGTH: 36 nucleotides 

FRAGMENT TYPE: Modified internal coding region of the HIV 
gap/poi polyprotein 

FEATURES: Codes for a decapeptide corresponding to the 
p6/FR oleavage site of the HIV aaa/eol polyprotein, and 
contains a cohesive end for the SstI restriction sits of 
the £. ooli j-galactosidase gene of plasmid pZMl. 
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INFORMATION FOR SEQ ID NOt 3: 
SEQUENCE TYPE: Oligonucleotide 
SEQUENCE LENGTH: 36 nucleotide 

FRAGMENT TYPE: Modified internal coding region of ths HIV 
SftS^Bfil polyprotein 

FEATURES i Codes for a decapeptide corresponding to the 
p«/PR cleavage sits of the HIV gag/pol polyprotein, and 
contains a cohesive end for the EcoRi restriction site of 
the S« poli £-galactosidase gene of plasmid pZMl. 

AATTT OTA AGC TTT AAC TTC CCT CAG ATC ACT CTG C 36 



INFORMATION FOR SEQ ZD NO: 4* 
SEQUENCE TYPE: Oligonucleotide 
SEQUENCE LENGTH: 36 nucleotides 

FRAGMENT TYPE: Modified internal coding rsgion of the HIV 
? ao/pol polyprotein 

FEATURES t Codes for a decapeptide corresponding to ths 
p6/PR cleavage site of the HIV gftsZpal polyprotein, and 
contains a cohesive end for the SstI restriction site of 
the fi. ogli l-galactosidase gene of plasmid pZMl. 
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0 GTA AGC TTT AAC TTC'CCT CAG ATC ACT CTG CAGCT 36 
INFORMATION FOR SEQ ID NO; 5s 
SEQUENCE TYPES Oligonucleotide 
SEQUENCE LENGTH: 30 nucleotides 

FRAGMENT TYPE i Modified internal coding region of the 
HIV qaa/ttol polyprotein 

FEATURES : Codes for a decapeptide corresponding to the 
p6/PR cleavage site of the HIV yaa/pol polyprotein, and 
contains a cohesive end for the Clal restriction site of 
the £• colt i-galactosidaee gene of plasmid p2Ml. 

C GTA AGC TTT AAC TTC CCT CAG ATC ACT CT 30 



INFORMATION FOR SEQ ID NO l 6: 
SEQUENCE TYPE: Oligonucleotide 
SEQUENCE LENGTHS 30 nucleotides 



FRAGMENT TYPE « Modified internal coding region of the HIV 
gagr/pol polyprotein 

FEATURES: Codes for a decapeptide corresponding to the 
p6/FR cleavage site of the HIV oao/pol polyprotein, and 
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contains a cohesive end for the Drain restriction site of 
the £. sail 0-galactosidaae gene of plaeaid pZKl. 

C OTA ASC ITT AAC TTC CCT CAO ATT ACG CT 30 

INFORMATION FOR SEQ ID NO: 7; 
SEQUENCE TYPES Oligonucleotide 
SEQUENCE LENGTH: 36 nucleotides 

FRAGMENT TYPE: Modified internal coding region of the HIV 
qa g/pol polyprotein 

FEATURES t Codes for a dscapsptide corresponding to the 
p6/PR oleavage site of the HIV fao/pol polyprotein, and 
contains a cohesive end for the Saul restriction site of 
the £• coli l-galaotosidase gene of plasaid pZMl. 

TQAA GTA AGO TTT AAC TTC CCT CAG ATT ACQ CTG AC 36 
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INFORMATION FOR SEQ ID HOt 8t 
SEQUENCE TVPSt Oligonucleotide 
SEQUENCE LENGTH i 36 nucleotide* 

FRAGMENT TYPE i Modified internal coding region of the HIV 
g^g/pol polyprotein 

FEATURES: Codes for a decapeptide corresponding to the 
p«/PR cleavage site of the HIV aasZRfll polyprotein, and 
contains a cohesive end for the BssKS restriction site of 
the fi* foli 0-galactosidase gene of plasmid pZMl. 

CGCGA GTA AGC TTT AAC TTC CCT CAO ATT ACQ CTG A 36 



INFORMATION FOR SEQ ID NO; 91 
SEQUENCE TYPE: Peptide 
SEQUENCE LENGTH: 10 amino acids 

FRAGMENT TYPE: Encoded by cleavage cassette in reverse 
orientation of plasmid pZM-ClaB 

FEATURES: Not recognized by HIV protease 

Glu ser Asp Leu Arg Giu Val Lys Ala Tyr 
15 10 
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INFORMATION FOR SEQ ID NO J 10 

SEQUENCE TYPE: Paptida 

SEQUENCE LENGTH: 13 amino aoida 

FRAGMENT TYPE: Polio 3C protaaaa claav»«» site 

Mat Olu Ala &au Pba Gin Gly Pro Lau Gin Tyr Lya Aap 

15 10 
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FIG. 2A FIG. 2B 
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FIG. 4 
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FIG. 5B 
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FIG. 7 
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FIG. 8A 



FIG. 8B 
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FIG. 9 
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© Screening methods for protease inhibitors. 



CO 
< 



© The invention provides methods to screen for 
inhibitors of proteases. The first method involves the 
construction of a plasmid system such that expres- 
sion of a protease is toxic to a bacterial strain. 
Compounds of interest may then be screened for 
those which inhibit the toxic protease and thereby 
prevent killing of the bacterial strain. Inhibition is 
detected by successful transformations of the bac- 
terial strain by the plasmid coding for the protease. 



nucleotide: 1 
amino acid: - 69 
Met 



The second method involves the construction of a 
fusion protein which contains a protease cleavage 
site inserted into a protein whose activity may be 
assayed. Compounds are screened for protease in- 
hibition, which is detected by assaying for a change 
in color, light or activity of the protein. HIV protease 
and polio 3C protease are examples of proteases 
that may be used in these methods. 
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